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Characterizations of Linear Sufficient Statistics 

By B. Charles Peters, Jrf, Richard Redner,! 
and Henry P. Decell, Jr.^ 

University of Houston 

We develop a necessary and sufficient condition that there exist 
a continous linear sufficient statistic T for a dominated col- 
lection of totally finite measures defined on the Borel field 
generated by the open sets of a Banach space X. In particular, 
corollary necessary and sufficient conditions that there exist a 
rank k linear sufficient statistic T for any finite collection of 
probability measures having n-variate normal densitites are given. 
In this case a simple calculation, involving only the population 
means and covariances, determines the smallest integer k for which 
there exists a rank k linear sufficient statistic T (as well as 
an associated statistic T Itself). 


^Thls author was partially supported by NASA Contract NAS-9-15000 
with the University of Houston during the preparation of this work. 
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REPRODUCIBnJTY OF THE 
PA S IS POOR 

!♦ Introduction . If W is a Hanach apace, 20 (W) will denote the Borel 
field generated by the open sets of W. The totally finite measures 
defined on iQ (W) will be denoted by 7?^ (W) , For )i,X E 7?^ (W) we will 
write U « X provided BE J0(W) and X(B) =0 implies p(B) =0. 

Whenever p « X, (dp/dX] will denote the equivalence class of Radon- 
Nikodym derivatives of p with respect to [2] [3]. It c ^(W) , ^p^^will 
be called a dominated (by X ) set of measures provided there exists 
X e?7^(W) (X not necessarily in ,0 ) such that p E implies 
p « X. We will call c ^(W) equivalent to X X) provided 

is dominated by X and p(B) = 0 for each p E implies X(B) = 0. 


If X and Y are Banach spaces and T:X -> Y then, following the notation 
in [3], we write f(e)T ^(C3(Y)) provided f;X ->• R (= Reals) and f Is 
(T ^(TB(Y) , ZB(R)) - measurable (as well as ( ©(X) , JlS(R) ) “ measurable). 

In [3], Halmos and Savage develop an approach to sufficient statistics. 
Their results provide an alternate definition, within a very general mathema- 
tical framework, of statistical sufficiency for dominated sets of measures. 
This alternate definition is particularly suitable to the development of the 
results in this paper. We will require the statement (Theorem 1.) of the 
alternate definition in the setting of Banach spaces. 

In all that follows X and Y will be Banach spaces, T a linear 
continuous mapping of X onto Y, and (X) a dominated set of 

measures. 

Theorem 1. (Halmos-Savage [ 3 ]) A necessary and sufficient condition that 
T be a sufficient statistic for is that there exist X e??7(X) such 
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that A and f. fclh/cJA] such that gjj(C)T ^(©(Y)) for each 

U Cc0. 

In this paper our particular concern will be that of developing 
necessary and sufficient conditions that a linear continuous mapping T 
of X onto Y be a sufficient statistic for a dominated set of measures 

In Theorem 2. we will require an additional condition on T which, to 

the best of our knowledge, is generally unavoidable . We will require 

that the kernel of T ( “ ker T) be complemented . in the sense that there 

exists a closed subspace S of X such that X = ker T ©S (e.g., if 

X is a Hilbert space, take S = (ker T)"^). 

In Theorem 4- we will show that the condition X = ker T ©S may be 

relaxed whenever [d)j/dA] contains a continuous representative. 

The res Its we develop are finally used to establish necessary and sufficient 

n Ic 

conditions that a linear statistic B:R R (k £ n) be sufficient for a 
finite collection of probability measures having n-variate normal densities. 

2, Principal Results . In all that follows we will assume that X and Y 
are Banach spaces, T:X -*■ Y is a linear continuous mapping of X 
onto Y, and is a dominated set of measures. 

Theorem 2. Let X = ker T © S for some closed subspace of X. A 
necessary and sufficient condition that T be a sufficient statistic for 
is that there exist A e?7j?(X) such that = A and, 

ker T c {y;g^(x + y) = g^(x), X e X} 

for each }J e oe' and some e [dyi/dA], 
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Proof. If T Is a sufficiunt statl.scic for <30^ and M e ^ then 
there exists (Theorem 1 X ^ 00 and G [dii/dX] such that (lB(T)* 

Suppose y g ker T and, without loss oE generality, there exists x^ g X 
such that g^(xQ + y) < Choose r G R such that g^(Xg + y) < r < B^CXq^' 

Since ^(-“,r) and g^ ^(r,«) are elements of (X) and g^(e)T ^(©(Y)) 
it follows that there exist and B^ c ®(Y) such that 

Xq + y e g"^(-“,r) = and Xq G g“^(r,“>) = t'^CB^). Now, since T 

is linear and y G ker T, T(Xq) g n = <J», which is absurd. 

Conversely, suppose O0 = X, pc ^ and ker T c {y:g^(x + y) = S^Cx), 

X G X} for some g G [dp/dX]. We need only show (according to Theorem 1/ 

that g^(€)T~^(fi(Y) . It will only be necessary to show that for r e R 

there exists B G ■© (Y) such that g ^(-“,r) ® T ^(B ). We will show 

r P r 

first that g^”^(-“,r) = t”^ T(g^~^(-“’, r) fi S) and then that 
B^ = T(g^"^(-“,r) n S) Gt0>(Y). 

If X e T"^(T(g ~^(-“,r) n S) then T(x) g T(g ^(-»,r) n S) and 

H H 

hence T(x) “ T(z) for some z G ^(-“,r) n S. Since T is linear 
X - z G ker T so that gjj(x) = - z -f z) = g^(z) < r and 

X G gjj"^(-“,r). 

; If X G g^ ^(-“,r) then, since X = ker T©S, x = k+ s for 

\ k G ker T and s G S, It follows that T(x) = T(s), s - x G ker T, 

gp(s) ” gy(s - X + x) = gjj(k) < r J S G ^(-®,r)j T(x) = T(s) g T(g^ ^(-”,r) n S) 

and, finally, that x c T ^(T(g ^(-“,r) n S)). 

i N 

We now show that T(g ^(-™,r) n S) G © (Y). Let T ;S ->■ Y be the 

f ^ 

! restriction of T to S and observe that T is a one to one continuous 

! ® 

i 
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REPRODUCIBILITY OP THE 
ORIGINAL PAGE IS POOR 

mapping of the Banaeh space S onto the Banach space Y. Since Tg 

satisfies the hypothesis of the open mapping theorem Tg is a 

homeomorphisra of S onto Y. Since such mappings take elements of t3 (S) 

into elements of (Y) and g is measurable, g ^(-”,r) n S £ £0 (X) n S =S(S). 

M M 

It follows that T(g, n S) ^ T„(e n S) £ 8 (Y) and the 

proof of the theorem is complete. 

Theorem 3. Let c0^ = X, X(B) = X(B - y) for each y e ker T and 
B ~ ^ ® non-empty open subset C of X 

and let [dp/dX] contain a continuous representative element for each 

M 

A necessary and sufficient condition that T be a sufficient statistic 
for is that 

ker T c {y ; f^(y + x) = ^ ^ 

Proof: In order to see that the condition is sufficient we need only show 

(according to Theorem 1.) that f^(e)T“^((8(Y)) , or equivalently, if r E R 

that f^"^(-“,r) = t""^(B^) for some e In fact, since T is an 

open mapping and f^ is continuous, T(f (~“,r)) G fi (Y) , We take 

B^ = T(f”^(-«>,r)) and conclude the argument by showing that 

f =: T~^T(f ~^(-",r)). We clearly need only establish that 

M 

T"^T(f "^(-“,r)) c f “^(- 00 , r). If X e T~^T(f ~^(-“,r)) then T(x) = T(z) 
y p H 

for some z £ f ^(-“,r). Since x - z E ker T it follows that 

y 

f (x) = f (x - z + z) = f. (z) < r and hence that x E f 
y ' ' y ' y M 
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In order to prove the necessity of the condition, lecail the proof of 
the necessity of the condition in Theorem 2. and observe that the hypothesis 
X = ker T © S for some closed subspace S of X was not essential. We 
may conclude that if p e there exists c [djJ/dX] such that 
ker T c {y ; g^(y + x) = 8^(x), x C X} and except on a set 

B e ^ (X) such that A(B) = 0. 

Fix y e ker T. Since {x : f^^(y -f x) g^(y + x)} = B - y and 
A(B - y) = A(B) = 0, we may conclude that except on 

C “ B u (B - y) r.nd A(C) = 0. Moreover, sii'ce the mapping x y + x 
is a homeomorphism of X onto X and f^ is continuous, C is an open 
subset of X. According to the hypothesis, X(C) = 0 and C open imply 
C is empty so that f,,(y + x) = -for each x e X. 

3. Normal Families . In what follows we will assume that <0 = 

is a family of m probability measures defined on (r") having normal 

densities 

p^(x) “ (27T) exp[- Y (x - - Hj^l; i = 0, 1, ... ,m-l. 

where and are known and is symmetric and positive definite. 

We will derive necessary and sufficient conditions that a k x n matrix B 
(k ^ n) mapping r'^ onto R*^ (i.e., rank (B) = k) be a sufficient 

statistic for first prove a Lemma. 

Lemma 1. If l^iSm-l and f^(x) = p^(x)/Pq(x) then 

ty ' f^Cy + x) “ X c x) = ker(W^^ - n ■ 
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Proof; Fix y e r". ACCcr a little matrix algebra (which we will omit) we 
find that + x) « each x e r" if and only if 

ax’^cn"^ - fi'^)y - - Ql\) + y^(J7“^ - ^hv = 0 

T —1 

for each x e R“. For x = -y/2 we see uliat y Hq) “0 so 

that y E - nQ^nQ}"*" • In addition, it follows that 

2x’^(n^^ - n"^)y + - «Q^)y “ 0 and, writing x = (z - y)/2, that 

z'^CSlj^^ “ = 0 for each z e X. This clearly implies “ 0 

so that y E ker(n^^ ** • -t-'® remaining containment follows easily. 

Theorem 4. A necessary and sufficient condition that a k x n rank k 
matrix B be a sufficient statistic for (F. that 

X X“U 

ker B c [ker(57^^ - n ’ 

Proof: Since the preliminary conditions of Theorem 3, are clearly satisfied 
for \ Pq, Lemma 1, ^nsures the necessity and sufficiency of the condition. 

Theorem 5. A necessary and sufficient condition, that a k x n rank k matrix B 

r .m-1 1 

be a sufficient statistic for that, for j = 1, ... , m - 1, 

T T “1 T T “1 

(a) (Bfl.B ) = (Bn_B ) 

j J 0 0 

(b) rij - Q^B'^(Bfi3'^)''^Bnj = Hq - fiQB'^(BnQB'’^)'^BnQ 

(c) n. - fl.B'^(Bf2,B^)’'^Bf2. = fin - finS"^ * 

J 3 31 0 0 0 0 
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T T — J 

Proof; Lot (x|y) = x y ami (x|y)^ " x y i = 0, 1, ... ,m - 1. 

For S S'*” and S'*'^ will donoto, respective! v, the orthoRona] 

complements of S relative to the inner products ( ) and ( -j* . 

*1 

If A Is an n n matrix A will denote the adjoint of A relative 
to the inner product ( '|* tn r". If A is a k x n matrix A will 
denote the adjoint op A relative to the inner products ^ *1' 
and ( 'I' ) on R^. It follows that B ^ . 

If B is a sufficient statistic for q then, according to 

Theorem 3., ker B c ker(n~^ - ; j 1» ... ,ra - 1 and hence 

I.J in 

(ker B) J » (ker B) ^ . Since this implies range (B •') => range (B we have 

*0 *n ~1 *•« *■< T T -1 T T “1 

that B (BB BB J = B and hence that J!jB (Bfi^B ) - flgB (BQgB ) 

which is (a) . 

Now let Q “ figB'^CBngB'*') ^B and observe that Q ^ *= Q - for 

j = 1, . . . ,m - 1 . It follows that ker Q = ker B c ker(n ^ - £2g^) and that 
^ 'k 

Q(fl ^ - S!„) ^ and hence that Q(n, - f2_) « J2, - fin which, 

jO jO jUju 

recalling the definition of Q, is equivalent to (c) . 

Since ker(fi“^ - fi“^) n ~ 

- Tig G (ker B)-^J = range (B = range (Q) , it follows that 
Q(Hj - tig) “ ~ ^0 recalling the definiton of Q, is equivalent 

to (b) . 


Since all of the preceeding arguments are reversible, (a), (b) and (c) 
imply B is a sufficient statistic for > completing the proof of 
the theorem. 

In the next theorem we will use the fact that there exists a non 
singular matrix M such that MT^qM = I and hence that the affine transform- 



9 


ation X '^Mx - Oq provides a nhnnr.o variables that allows (wlchout loss 
oC geaeralitiy or the ability tn reeover the sufficient statistic illative to 
the original variables) one tt ansume that « 0 and * !• 

Theorem 6. If ^ ^ then a necessary and sufficient condition 

that a k X n rank k matrix B be sufficient for (P. }? i Is that there 

1 1=0 

exist a rank k orthogonal projection Q such that, for 1=1, ... ,m - 1, 


(I “ Q)[niiri2l - I 1^2 " ••• 1^1 - 


where Z is the n x (n + 1) (m - 1) zero matrix. 


Proof; If B la a sufficient statistic for assume without 

T 

loss of generality that BB = I since B is a sufficient statistic for 

if and only if KB is a sufficient statistic for each nonsingular 
k X k matrix K. One may indeed choose K such that KBB^K^ = (KB)(KB)^= I 
For i = 1, ... ,m - 1 Theorem 5. implies that 

T T -1 T T “1 T 
) = IB (BIB ) =B 


BO that 


T -1 “It T t -1 T 

(Sn^B ) = BS2j B and (Bfl^B ) B = B B 


Right multiplication of the latter equation by fJ^B B will establish that 

T T T 

n.B R = B Bfi.B B 
1 1 


from whence it follows, using symmetry, that 
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Since ® 0 “nfl fij X> Theorem S- Turther Implies 

- B’‘n « 0 

and 

' j , - « I - b’*’b 

i 1 

T T + 

Since BB = I, it follows that B « B (where (•) denotes the 

_ T + 

generalized inverse of (•)) and hence tliat Q = BB»BB is the 

T 

orthogonal projection on the rang® of B [5]. Clearly Q has rank k and 

we conclude that 

(I - Q)nj, « 0 

and 

(I - ™ I) = Z 

and the condition follows. Conversely, if the conditon holds let B be any 

T i' 

k ^ n rank k matrix such that range (B ) = range (Q) . Clearly B B - Q, 

= I and B^ = Using the symmetry of I - Q and - I we conclude 

that 

Q-b’^b =» b’^bQ. 

i 1 

and hence that 

Q = b’^B = B'*'Bfi^B'^(Bn^B'^)'’^B =< 

= P^B'^(Bn^B'^)~^J . 

In addition, 

'P T — 1 T 

) =» B 

The obvious substitution for Q guarantees the satisfaction of the 


conditions of Theorem 5 . 
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Definition 1. We will nny that a rank k orthogonal projection Q 
generates a nuffieif'nt statistic for provltled Q satisfies 

the condition in Theorem 6. 

Corollary 1. If [hilnol •••In , [ ~ I [ . • . - I] then 

^ '^12 m-1 1 m-1 

a) Q “ MM^ generates a sufficient statistic for (P, }? i 

t i=U 

and 

*4* 

b) k =• rank (MM ) = tr (MM ) is the smallest integer for which 

there exists a rank k orthogonal projection generating a 

sufflrl^cnt statistic for {P. }? i 

1 i=0 

Proof: Let k be the smallest integer for which there exists a rank k 

orthogonal projection P generi^ting a sufficient statistic for • 

According to the definition of M, (I - P)M =■ E so that PM = M 
and PMM"^ = MM"*^ . Since (I - MM^)M = E , MM^ generates a sufficient 

statistic for ' however, PMM = I'Dl implies timt range 

+ 'h 

(MM ) c range (P) so that the minimality of k and the fact that MM is 

an orthogonal projection imply that range (MM^) = range (P) and hence that 

mm"*" = P. 

Corollary 2. If B is a sufficient statistic for then 

i = 0, 1, . . . ,m - 1 c 

Proof; The conclusion is an immediate consequence of line 6 in the proof 


of Theorem 6. 
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4. Concluding Rtmarks, Theorems 4 and 5, althougli not so stated, are 

valid for arbitrary families of n-variate normal probability measures. 

Corollary 1, formally gives the construction for a sufficient statistic 

for finite families of n-varlate normal probability measures solely in 

tenns of tl)e known parameters that determine the densities. In fact, if 

4* 

k=rank (H) (“rank MM ) then any rank k matrix B for which range (B)“range (M) 
is a sufficient statistic for the family. Moreover, in terms of the 
dimension of the range of a sufficient statistic, k^rank M is the smallest 
integer for which there exists a sufficient statistic. 

Several open questions concerning the "appropriate" definition of a 
"almost" sufficient statistic using the characterizations given in 
Theorems 4. and 5. will be the subject of a later paper. In this connection 
the results of Le Cam [4], although the approach is different, should be of 
significant value. 

5, Acknowledgement . The authors would like to express there sincere 
appreciation to Professor H. Elton Lacey for his comments. 
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